Method and system for synchroniziing presentation slide content with soundtrack

ABSTRACT

A method for synchronizing a plurality of presentation slide content with a soundtrack comprises obtaining the plurality of presentation slide content and the soundtrack including a plurality of audio samples. The presentation slide content comprises a video or an animation in the presentation slide. Each presentation slide content is associated with a metadata and each audio sample is indexed with a corresponding timecode. The method comprises detecting triggering event that identifies a current audio sample of the soundtrack as an audio sample to transition from a first presentation slide content to a second presentation slide content, and obtaining a timecode indexed with the identified audio sample, associating the timecode with the metadata of the second presentation slide content to link the second presentation slide content with the identified audio sample, and generating a synchronized presentation multimedia file having the linked second presentation slide content with the identified audio sample.

CROSS-REFERENCE To RELATED APPLICATIONS

The present application is a continuation of U.S. Non-Provisionalapplication Ser. No. 17/552,647, filed Dec. 16, 2021, entitled “Methodand System For Synchronizing Presentation Slide Content with aSoundtrack” which claims the benefit of U.S. Provisional Application No.63/151,407, filed Feb. 19, 2021, each of which is incorporated byreference herein in its entirety.

BACKGROUND OF THE INVENTION

In today's digital age, more and more businesses are utilizingmultimedia presentations to create tutorials, sales presentations, andthe like. Although, there are several tools to accomplish the same, theavailable mechanisms make the entire process of creating the multimediapresentations tedious, complex, and time-consuming. Moreover, theavailable tools do not provide any flexibility to the users and aredifficult to change.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

The accompanying figures, where like reference numerals refer toidentical or functionally similar elements throughout the separateviews, together with the detailed description below, are incorporated inand form part of the specification, and serve to further illustrateembodiments of concepts that include the claimed invention and explainvarious principles and advantages of those embodiments.

FIG. 1 illustrates an exemplary environment including a system forsynchronizing a plurality of presentation slide content with asoundtrack, in accordance with some embodiments;

FIG. 2 illustrates a block diagram of an exemplary user device, inaccordance with some embodiments;

FIG. 3 illustrates an exemplary method for synchronizing the pluralityof presentation slide content with the soundtrack, in accordance withsome embodiments;

FIG. 4 illustrates an exemplary method for editing the plurality ofpresentation slide content, in accordance with some embodiments; and

FIGS. 5 through 9 illustrate exemplary implementations for synchronizingthe plurality of presentation slide content with the soundtrack, inaccordance with some embodiments;

Skilled artisans will appreciate that elements in the figures areillustrated for simplicity and clarity and have not necessarily beendrawn to scale. For example, the dimensions of some of the elements inthe figures may be exaggerated relative to other elements to help toimprove understanding of embodiments of the present invention.

The apparatus and method components have been represented whereappropriate by conventional symbols in the drawings, showing only thosespecific details that are pertinent to understanding the embodiments ofthe present invention so as not to obscure the disclosure with detailsthat will be readily apparent to those of ordinary skill in the arthaving the benefit of the description herein.

DETAILED DESCRIPTION OF THE INVENTION

In one aspect, a method for synchronizing a plurality of presentationslide content in a presentation slide with a soundtrack is described.The method comprises obtaining the plurality of presentation slidecontent arranged in a predefined order in the presentation slide and thesoundtrack including a plurality of audio samples. The presentationslide content comprises a video in the presentation slide or ananimation in the presentation slide. Each presentation slide content isassociated with a metadata representing unique identification of thecorresponding presentation slide content and each audio sample isindexed with a corresponding timecode. The method further comprisesdetecting a triggering event identifying an audio sample from theplurality of audio samples of the soundtrack to transition from a firstpresentation slide content to a second presentation slide content of theplurality of presentation slide content in the presentation slide. Themethod further comprises obtaining a timecode indexed with theidentified audio sample of the soundtrack corresponding to the detectedtriggering event and associating the obtained timecode with the metadataof the second presentation slide content in the presentation slide tolink the second presentation slide content in the presentation slidewith the identified audio sample. Further, the method comprisesgenerating a synchronized presentation multimedia file having the linkedsecond presentation slide content in the presentation slide with theidentified. audio sample.

In another aspect, a system for synchronizing a plurality ofpresentation slide content in a presentation slide with a soundtrack isdescribed. The system comprises a memory and a processor communicativelycoupled to the memory. The memory stores instructions executable by theprocessor, and wherein upon execution of the stored instructions, theprocessor is configured to obtain the plurality of presentation slidecontent arranged in a predefined order in the presentation slide and thesoundtrack including a plurality of audio samples. The presentationslide content comprises a video in the presentation slide or ananimation in the presentation slide. Each presentation slide content isassociated with a metadata representing unique identification of thecorresponding presentation slide content and each audio sample isindexed with a corresponding timecode. The processor is furtherconfigured to detect a triggering event identifying an audio sample fromthe plurality of audio samples of the soundtrack to transition from afirst presentation slide content to a second presentation slide contentof the plurality of presentation slide content in the presentation slideand obtain a timecode indexed with the identified audio sample of thesoundtrack corresponding to the detected triggering event. The processoris further configured to associate the obtained timecode with themetadata of the second presentation slide content in the presentationslide to :link the second presentation slide content in the presentationslide with the identified audio sample and generate a synchronizedpresentation multimedia file having the linked second presentation slidecontent in the presentation slide with the identified audio sample.

In yet another aspect, a method fir synchronizing a plurality ofpresentation slide content with a soundtrack is described. The methodcomprises obtaining the plurality of presentation slide content arrangedin a predefined order and the soundtrack including a plurality of audiosamples. Each presentation slide content is associated with a metadatarepresenting unique identification of the corresponding presentationslide content and each audio sample is indexed with a correspondingtimecode. The method further comprises detecting a triggering eventidentifying an audio sample from the plurality of audio samples of thesoundtrack to transition from a first presentation slide content to asecond presentation slide content of the plurality of presentation slidecontent. The method further comprises obtaining a timecode indexed withthe identified audio sample of the soundtrack corresponding to thedetected triggering event and associating the obtained. timecode withthe metadata of the second presentation slide content to link the secondpresentation slide content with the identified audio sample. Further,the method comprises generating a synchronized presentation multimediafile having the linked second presentation slide content with theidentified audio sample. The method further comprises receiving one ormore edits to the second presentation slide content of the plurality ofpresentation slide content and updating the second presentation slidecontent to reflect the received one or more edits. Further, the methodcomprises replacing the corresponding second presentation slide contentin the obtained plurality of presentation slide content with the updatedsecond presentation slide content and detecting a further triggeringevent to associate the metadata of the replaced. second presentationslide content with the timecode of the identified audio sample of theplurality of audio samples of the soundtrack. The method furthercomprises associating the metadata of the replaced second presentationslide content with the timecode of the identified audio sample.

FIG. 1 illustrates an environment 100 implementing an exemplary system102 in accordance with various embodiments. The system 102 is configuredto synchronize a plurality of presentation slide content with asoundtrack to generate a synchronized presentation multimedia file. Forexample, the presentation slide content includes, but is not limited to,one or more of a presentation slide, a video in a presentation slide, ananimation in a presentation slide, or the like. The soundtrack includes,but is not limited to, an audio track, such as a narration, a song, orthe like, having a plurality of audio samples. In accordance withvarious embodiments, each audio sample corresponds to a unique portionof the soundtrack. Further, the synchronized presentation multimediafile corresponds to a multimedia file having one or more of a pluralityof presentation slide content synchronized with a corresponding uniqueaudio sample of the soundtrack.

In addition to the system 102, the environment 100 also includes one ormore user devices 104, such as, 104-1, 104-2 . . . 104-n, communicatingwith the system 102 over a communication network. Although not shown, aperson skilled in the art would appreciate that the communicationnetwork may include, but is not limited to, a wide area network (WAN)(for example, a transport control protocol/internet protocol (TCP/IP)based network), a cellular network, or a local area network (LAN)employing any of a variety of communications protocols as is now knownor in the future developed.

Each of the one or more user devices 104, such as, 104-1, 104-2 . . .104-n, operates as an interface for a corresponding user interactingwith the system 102. Each user utilizes the respective user device 104to provide one or more inputs, such as but not limited to, the pluralityof presentation slide content and the soundtrack, and receive one ormore outputs, such as, but not limited to, the synchronized presentationmultimedia file, from the system 102. Each user device 104 includes aplurality of electrical and electronic components, providing power,operational control, communication, and the like within the user device104. The various components of the user device 104 will now be describedhereinafter.

It should be appreciated by those of ordinary skill in the art that FIG.2 depicts the user device 104 in a simplified manner and a practicalembodiment may include additional components and suitably configuredlogic to support known or conventional operating features that are notdescribed in detail herein. It will further be appreciated by those ofordinary skill in the art that the user device 104 is a personalcomputer, desktop computer, tablet, smartphone, or any other computingdevice now known or developed in the future. It will further beappreciated by those of ordinary skill in the art that the user device104 alternatively may function within a remote server, cloud computingdevice, or any other local or remote computing mechanism now known ordeveloped in the future. Although the user device 104 is shown anddescribed to be implemented within a single computing device, it may becontemplated that the one or more components of the user device 104 mayalternatively be implemented in a distributed computing environment.

The user device 104 includes, among other components, a user devicetransceiver 106, a user device interface 108, a user device display 114,a user device processor 110, and a user device memory 112. Thecomponents of the user device 104, including the user device transceiver106, the user device interface 108, the user device display 114, theuser device processor 110, and the user device memory 112, cooperatewith one another to enable operations of the user device 104. Eachcomponent may communicate with one another via a local interface (notshown). The local interface may be, such as, but not limited to, one ormore buses or other wired or wireless connections, as is known in theart. The local interface may have additional elements, which are omittedfor simplicity, such as controllers, buffers (caches), drivers,repeaters, and receivers, among many others, to enable communications.Further, the local interface may include address, control, and/or dataconnections to enable appropriate communications among theaforementioned components.

As illustrated, the user device 104 in the exemplary embodiment includesthe user device transceiver 106 to provide the one or more inputs to andreceive the one or more outputs from other devices, such as, the system102. The user device transceiver 106 includes a transmitter circuitryand a receiver circuitry to enable the user device 104 to communicatedata to and acquire data from the system 102. In this regard, thetransmitter circuitry includes appropriate circuitry to provide the oneor more inputs, such as, but not limited to, the plurality ofpresentation slide content and the soundtrack, to the system 102.Similarly, the receiver circuitry includes appropriate circuitry toreceive the one or more outputs, such as, but not limited to, thesynchronized presentation multimedia file and one or more instructionsassociated with the generation of the synchronized presentationmultimedia file from the system 102. It will be appreciated by those ofordinary skill in the art that the user device 104 may include a singleuser device transceiver 106 as shown, or alternatively separatetransmitting and receiving components, for example but not limited to, atransmitter, a transmitting antenna, a receiver, and a receivingantenna.

The user device interface 108 is configured to receive user input fromand/or to provide system output to the user or to one or more devices orcomponents. User input may be provided via a keyboard, a touch screendisplay (such as, the user device display 114), a camera, a touch pad, amicrophone, a recorder, a mouse or any other user input mechanism nowknown or developed in the future. System output may be provided via adisplay device, such as the user device display 114, speakers, aspeaker, a haptic output, or any other output mechanism now known ordeveloped in the future. The user device interface 108 may furtherinclude a serial port, a parallel port, an infrared (IR) interface, auniversal serial bus (USB) interface and/or any other interface hereinknown or developed in the future.

In some embodiments, the user device interface 108 includes a userdevice graphical user interface 116 through which the user communicatesto and from the system 102. The user device graphical user interface 116may be an application or web portal or any other suitable interface togenerate the synchronized presentation multimedia file now known ordeveloped in the future. The user device graphical user interface 116includes one or more of graphical elements associated with thegeneration of the synchronized presentation multimedia file. Thegraphical elements may include, but not limited to one or more ofgraphical icons, control buttons, timelines, selection boxes, progressindicators, pull-down menus, on-off checkmarks, scroll bars, windows,window edges, toggle buttons, and/or forms. The graphical elements maybe used in conjunction with text to prompt the user for an input,respond to user actions, or display information to the user in responseto the one or more instructions from the system 102.

The user device display 114 is configured to display data, images, andthe like. The user device display 114 includes a display screen or acomputer monitor now known or in the future developed. In accordancewith some embodiments, the user device display 114 is configured todisplay the user device graphical user interface 116 associated with thegeneration of the synchronized presentation multimedia file.

The user device memory 112 is a non-transitory memory configured tostore a set of instructions that are executable by the user deviceprocessor 110 to perform predetermined operations. For example, the userdevice memory 112 may include any of the volatile memory elements (forexample, random access memory (RAM), nonvolatile memory elements (forexample, read only memory (ROM), and combinations thereof. Moreover, theuser device memory 112 may incorporate electronic, magnetic, optical,and/or other types of storage media. In some embodiments, the userdevice memory 112 is also configured to store files, such as but notlimited to, the plurality of presentation slide content, the soundtrack,and the synchronized presentation multimedia file.

The user device processor 110 is configured to execute the instructionsstored in the user device memory 112 to perform the predeterminedoperations, for example, the detailed functions of the user device 104as will be described hereinafter. The user device processor 110 mayinclude one or more microprocessors, microcontrollers, DSPs (digitalsignal processors), state machines, logic circuitry, or any other deviceor devices that process information or signals based on operational orprogramming instructions. The user device processor 110 may beimplemented using one or more controller technologies, such asApplication Specific Integrated Circuit (ASIC), Reduced Instruction SetComputing (RISC) technology, Complex Instruction Set Computing (CISC)technology, or any other similar technology now known or in the futuredeveloped. The user device processor 110 is configured to cooperate withother components of the user device 104 to perform operations pursuantto communications and the one or more instructions from the system 102.

Referring back to FIG. 1, the system 102, the system 102 is configuredto obtain the plurality of presentation slide content and the soundtrackfrom the user device 104 and generate the synchronized presentationmultimedia file, The system 102 may be electrically and/orcommunicatively connected to a variety of devices, such as, the userdevice 104. In some embodiments, the system 102 includes a plurality ofelectrical and electronic components, providing power, operationalcontrol, communication, and the like within the system 102. For example,the system 102 includes, among other things, a system transceiver 120, asystem processor 124, and a system memory 126.

It should be appreciated by those of ordinary skill in the art that FIG.1 depicts the system 102 in a simplified manner and a practicalembodiment may include additional components and suitably configuredlogic to support known or conventional operating features that are notdescribed in detail herein. It will further be appreciated by those ofordinary skill in the art that the system 102 may be a personalcomputer, desktop computer, tablet, smartphone, or any other computingdevice now known or developed in the future.

Further, although the system 102 is shown and described to beimplemented within a single computing device, it may be contemplatedthat the one or more components of the system 102 may alternatively beimplemented in a distributed computing environment, without deviatingfrom the scope of the claimed subject matter. It will further beappreciated by those of ordinary skill in the art that the system 102alternatively may function within a remote server, cloud computingdevice, or any other remote computing mechanism now known or developedin the future. Such as, the system 102 in some embodiments may be acloud environment incorporating the operations of the system transceiver120, the system processor 124, and the system memory 126, and variousother operating modules to serve as a software as a service model forthe user devices 104. In some embodiments, the system 102 and the userdevice 104 may be implemented in a single device to perform theoperations of both the system 102 and the user device 104, as will bedescribed hereinafter.

The components of the system 102, including the system transceiver 120,the system processor 124, and the system memory 126 may communicate withone another via a local interface (not shown). The local interface maybe, namely, but not limited to, one or more buses or other wired orwireless connections, as is known in the art. The local interface mayhave additional elements, but not limited to, such as controllers,buffers (caches), drivers, repeaters, and receivers, among many others,to enable communications. Further, the local interface may includeaddress, control, and/or data connections to enable appropriatecommunications among the aforementioned components.

The system transceiver 120 includes a transmitter circuitry and areceiver circuitry (not shown) to enable the system 102 to communicatedata to and acquire data from other devices such as the user devices104. In this regard, the transmitter circuitry may include appropriatecircuitry to transmit data such as, but not limited to, the synchronizedpresentation multimedia files and the one or more instructionsassociated with the generation of the synchronized presentationmultimedia file to the user devices 104. Similarly, the receivercircuitry may include appropriate circuitry to receive data such as, butnot limited to, the plurality of presentation slide content and thesoundtrack from the user devices 104. The transmitter circuitry and thereceiver circuitry together form a wireless transceiver to enablewireless communication with the user device 104. It will be appreciatedby those of ordinary skill in the art that the system 102 may include asingle system transceiver 120 as shown, or alternatively separatetransmitting and receiving components, for example but not limited to, atransmitter, a transmitting antenna, a receiver, and a receivingantenna.

The system memory 126 is a non-transitory memory configured to store aset of instructions that are executable by the system processor 124 toperform the predetermined operations. For example, the system memory 126may include any of the volatile memory elements (for example, randomaccess memory (RAM), nonvolatile memory elements (for example read onlymemory (ROM), and combinations thereof. Moreover, the system memory 126may incorporate electronic, magnetic, optical, and/or other types ofstorage media. In some embodiments, the system memory 126 is alsoconfigured to store files, such as, but not limited to, the plurality ofpresentation slide content, the soundtrack, and the synchronizedpresentation multimedia file.

The system processor 124 is configured to execute the instructionsstored in the system memory 126 to perform the predetermined operations,for example, the detailed functions of the system 102 as will bedescribed hereinafter. The system processor 124 may include one or moremicroprocessors, microcontrollers, DSPs (digital signal processors),state machines, logic circuitry, or any other device or devices thatprocess information or signals based on operational or programminginstructions. The system processor 124 may be implemented using one ormore controller technologies, such as Application Specific IntegratedCircuit (ASIC), Reduced Instruction Set Computing (RISC) technology,Complex Instruction Set Computing (CISC) technology, or any othertechnology now known or in the future developed.

In operation, the system 102 is configured to obtain the plurality ofpresentation slide content. In an embodiment, the system 102 maytransmit a request to the user device 104 to provide the plurality ofpresentation slide content to the system 102. In an exemplaryembodiment, the request may also include one or more instructions toprompt the user, for example via one or more graphical elements on theuser device graphical user interface 116, to provide the plurality ofpresentation slide content. Upon receiving the request, the user device104 is configured to obtain the plurality of presentation slide content.In an exemplary embodiment, the user device 104 is configured to obtainthe plurality of presentation slide content in response to the userselecting the one or more graphical elements on the user devicegraphical user interface 116. The user device 104 is further configuredto provide the obtained plurality of presentation slide content to thesystem 102. In another embodiment, the system 102 is configured toreceive a signal from the user device 104 notifying transmission of theplurality of presentation slide content from the user device 104 to thesystem 102. Upon receiving the signal, the system 102 is configured toobtain the plurality of presentation slide content from the user device104. In accordance with various embodiments, the user device 104 isconfigured to obtain the plurality of presentation slide content fromthe user device memory 112, based on the selection of the user. In someembodiments, the user device 104 is configured to obtain the pluralityof presentation slide content from other devices and store the obtainedplurality of presentation slide content in the user device memory 112.In some other embodiments, the user device 104 is configured to obtainthe plurality of presentation slide content from, for example, apresentation application, via which the presentation slide content iscreated and store the created plurality of presentation slide content inthe user device memory 112. The user device 104 is further configured toprovide the plurality of presentation slide content to the system 102.

In some embodiments, the plurality of presentation slide content isobtained in a predefined order. In accordance with various embodiments,the predefined order corresponds to a sequence for viewing the pluralityof presentation slide content in the synchronized presentationmultimedia file. In an embodiment, the system 102 may provide one ormore instructions to the user device 104 to provide the plurality ofpresentation slide content in the predefined order to the system 102.For example, in an embodiment, the one or more instructions may includeinstructions to prompt the user, for example, via one or more graphicalelements on the user device graphical user interface 116, to provide theplurality of presentation slide content in the predefined order. Uponreceiving the instructions, the user device 104 is configured to obtainthe plurality of presentation slide content arranged in the predefinedorder. In some embodiments, the user device 104 may obtain the pluralityof presentation slide content arranged in the predefined order from thepresentation application, on which the presentation slide content isarranged in the predefined order. In some embodiments, the user device104 is configured to obtain the plurality of presentation slide contentarranged in the predefined order from other devices. The obtainedplurality of presentation slide content arranged in the predefined ordermay be stored in the user device memory 112. The user device 104 isconfigured to obtain the plurality of presentation slide contentarranged in the predefined order from the user device memory 112 basedon inputs from the user. The user device 104 is further configured totransmit the plurality of presentation slide content arranged in thepredefined order to the system 102. In another embodiment, the system102 is configured to receive a signal from the user device 104 notifyingtransmission of the plurality of presentation slide content arranged inthe predefined media order from the user device 104 to the system 102.Upon receiving the signal, the system 102 is configured to obtain theplurality of presentation slide content arranged in the predefined mediaorder from the user device 104. In some embodiments, the system 102 isconfigured to provide one or more instructions to the user device 104 todisplay one or more of the obtained plurality of presentation slidecontent in the predefined order on the user device graphical userinterface 116 of the user device 104.

In accordance with various embodiments, each of the plurality ofpresentation slide content is associated with a meta.data representingunique identification of the corresponding presentation slide content.For instance, the metadata includes, but is not limited to, a serialnumber, a content number, a slide number, and/or any uniqueidentification, of the corresponding presentation slide content. Inaccordance with some embodiments, the system 102 is configured to obtainthe metadata corresponding to each of the plurality of presentationslide content from the user device 104. To this end, the system 102 mayprovide a request to the user device 104 to provide the metadatacorresponding to each of the plurality of presentation slide content.Upon receiving the request, the user device 104 is configured to obtainthe metadata corresponding to each of the plurality of presentationslide content from the user device memory 112. In another embodiment,the system 102 is configured to receive a signal from the user device104 notifying transmission of the metadata corresponding to each of theplurality of presentation slide content from the user device 104 to thesystem 102. Upon receiving the signal, the system 102 is configured toobtain the metadata corresponding to each of the plurality ofpresentation slide content from the user device 104.

In accordance with some embodiments, the user device 104 is configured.to obtain the metadata corresponding to each of the plurality ofpresentation slide content from the presentation application, when theplurality of presentation slide content is created using thepresentation application. In such cases, the user device 104, forexample, the presentation application, is configured to generate themetadata corresponding to each of the plurality of presentation slidecontent. In some other embodiments, the user device 104 is configured toreceive the metadata corresponding to each of the plurality ofpresentation slide content from other devices, when the plurality ofpresentation slide content is received by the user device 104 from theother device. The received metadata is then stored in the user devicememory 112. The user device 104 is configured to obtain the metadatacorresponding to each of the plurality of presentation slide contentfrom the user device memory 112. The user device 104 is furtherconfigured to transmit the metadata corresponding to each of theplurality of presentation slide content to the system 102.

Although not described in detail, it will be appreciated that the system102 may be configured to obtain the plurality of presentation slidecontent and the associated metadata from two or more user devices 104 toenable the respective two or more users to collaborate for creation ofthe synchronized presentation multimedia file. For example, a firstse(of plurality of presentation slide content and the metadatacorresponding to the first set of plurality of presentation slidecontent may be obtained from a first user device 104-1 and a second setof set of plurality of presentation slide content and the metadatacorresponding to the second set of set of plurality of presentationslide content may be obtained from a second user device 104-2,

The system 102 is configured to obtain the soundtrack including aplurality of audio samples from the user device 104. In accordance withvarious embodiment, each audio sample may correspond to a unique portionof the soundtrack. In an embodiment, the system 102 may transmit arequest to the user device 104 to provide the soundtrack to the system102. In an exemplary embodiment, the request may also include one ormore instructions to prompt the user, for example, via one or moregraphical elements on the user device graphical user interface 116, toprovide the soundtrack. Upon receiving the request, the user device 104is configured to obtain the soundtrack from the user. In an exemplaryembodiment, the user device 104 is configured to obtain the soundtrackwhen the user selects the one or more graphical elements on the userdevice graphical user interface 116. In another embodiment, the system102 is configured to receive a signal from the user device 104 notifyingtransmission of the one or more soundtracks from the user device 104 tothe system 102. Upon receiving the signal, the system 102 is configuredto obtain the one or more soundtracks from the user device 104. in anembodiment, the system 102 is configured to receive one or more musictracks to merge the received one or more music tracks with the obtainedsoundtrack.

In some embodiments, the user device 104 is configured to record thesoundtrack. To this end, the system 102 is configured to instruct theuser device 104 to display one or more graphical elements on the userdevice graphical user interface 116 to determine whether the userchooses to record the soundtrack. The system 102 is configured todetermine that the user chooses to record the soundtrack, when the userselects the displayed one or more graphical elements on the user devicegraphical user interface 116. Upon receiving the selection, in variousembodiments, the system 102 is configured to instruct the user device104 to allow the user to record the soundtrack via one or more userdevice interface 108, for example, via a microphone or an audio recorderwithin or communicatively coupled to the user device 104. In accordancewith some embodiments, the recording of the soundtrack may be performedusing a teleprompter. To this end, the system 102 may provide one ormore instructions to the user device 104 to prompt the user, forexample, via one or more graphical elements on the user device graphicaluser interface 116, to import a text file comprising narration for thesoundtrack. Upon receiving the request, the user device 104 isconfigured to import the text file, for example, stored in the userdevice memory 112. In an exemplary embodiment, the user device 104 isconfigured to import the text file when the user selects the one or moregraphical elements on the user device graphical user interface 116. Inan exemplary embodiment, the imported text file is displayed via one ormore graphical elements such as a teleprompter on the user devicegraphical user interface 116. The system 102 is configured to instructthe user device 104 via one or more graphical elements on the userdevice graphical user interface 116 to record the narration displayed onthe teleprompter via the microphone or the recorder within orcommunicatively coupled to the user device 104. In accordance with anembodiment, the system 102 is configured to provide one or moreinstructions to the user device 104 to re-record the narration, via oneor more graphical elements if the user chooses to re-record a portion ofthe narration. Upon receiving the one or more instructions, the userdevice 104 allows the user to re-record the portion of the narration. Inyet some other embodiments, the user device 104 is configured to obtainthe soundtrack from other devices. In the above discussed cases, theuser device 104 is configured to store the soundtrack in the user devicememory 112. The user device 104 is configured to obtain the soundtrackfrom the user device memory 112, based on the selection of the user andprovide the obtained soundtrack to the system 102. In some embodiments,the obtained soundtrack may include multiple soundtracks mixed together.In some embodiments, the system 102 is configured to provide one or moreinstructions to the user device 104 to display a waveform associatedwith the obtained soundtrack on the user device graphical user interface116 of the user device 104.

In accordance with various embodiments, each of the plurality of audiosamples is indexed with a corresponding timecode. The timecode may beany standard timecode, such as Society of Motion Picture and TelevisionEngineers (SMPTE) time code or the like. In some embodiments, the system102 is configured to obtain the timecode associated with each audiosample from the user device 104. In an embodiment, the system 102 mayprovide a request to the user device 104 to provide the timecodeassociated with each audio sample to the system 102. In anotherembodiment, the system 102 is configured to receive a signal from theuser device 104 notifying transmission of the timecode associated witheach audio sample from the user device 104 to the system 102. Uponreceiving the signal, the system 102 is configured to obtain thetimecode associated with each audio sample from the user device 104. Inaccordance with some embodiments, when the user device 104 records thesoundtrack, the user device 104 is configured to index each audio sampleof the soundtrack with the timecode. In some other embodiments, when theuser device 104 receives the soundtrack from other devices, the userdevice 104 is configured to receive the indexed timecode correspondingto each audio sample. In both cases, the user device 104 is configuredto store the indexed timecodes in the user device memory 112. The userdevice 104 is further configured to obtain the timecode associated witheach audio sample from the user device memory 112 and transmit thetimecode to the system 102.

The system 102 is further configured to detect a triggering eventidentifying an audio sample from the plurality of audio samples of thesoundtrack to transition from a first presentation slide content to asecond presentation slide content of the plurality of presentation slidecontent. In accordance with an embodiment, the triggering event mayinclude one or more textual marks in a transcribed text file of thesoundtrack provided by the user. In some other embodiments, thetriggering event may include one of a mouse click, a key press, a screentouch, or any trigger now known or developed in the future.

In accordance with an embodiment, the system 102 is configured totransmit one or more instructions to the user device 104 to determinewhether the user chooses to add the one or more textual marks in thetranscribed text file of the soundtrack. In particular, the one or moreinstructions may include instructions to display one or more graphicalelements on the user device graphical user interface 116 to determinewhether the user chooses to add the one or more textual marks in thetranscribed text file of the soundtrack. The system 102 is configured todetermine that the user chooses to add the one or more textual marks inthe transcribed text file, when the user selects the displayed one ormore graphical elements on the user device graphical user interface 116.

Upon receiving the selection, the system 102, for example, via thesystem processor 124, is configured to transcribe the obtainedsoundtrack into a text file, to obtain the transcribed text file of thesoundtrack. The system 102 is configured to instruct the user device 104to display the transcribed text file on the user device graphical userinterface 116. In accordance with an embodiment, the system 102 mayprovide one or more instructions to the user device 104 to prompt theuser, for example, via one or more graphical elements, on the userdevice graphical user interface 116, to add one or more textual marks inthe displayed transcribed text file. The one or more textual marks maybe any textual symbol or a special character, including but not limitedto, an asterisk (*), an exclamation mark (!), a hash (#) or any othermark now know or in the future developed . Upon receiving the one ormore instructions, the user device 104 is configured to prompt the user,via the one or more graphical elements on the user device graphical userinterface 116, to add the one or more textual marks in the transcribedtext file. In some embodiments, the user device 104 is configured toenable the user to add the one or more textual marks, for example, viathe user device interface 108. The user device 104 is configured totransmit the transcribed text file having the one or more textual marksto the system 102.

In accordance with various embodiments, the system 102 is configured toconsider the addition of one or more textual marks in the transcribedtext file as a trigger that the user chooses to initiate thesynchronization process (as described in detail hereinafter) ofsynchronizing the plurality of presentation slide content with thesoundtrack. To this end, the system 102 is configured to detect thetriggering event, identifying an audio sample from the plurality ofaudio samples of the soundtrack, to transition from a first presentationslide content to a second presentation slide content of the plurality ofpresentation slide content. In accordance with the various embodiments,the system 102 is considered to identify the first presentation slidecontent and the second presentation slide content based on the stage ofthe synchronization process, as described in detail hereinafter. Forexample, at the start of the synchronization process, the firstpresentation slide content may correspond to a presentation slidecontent arranged at a first position in the predefined order and thesecond presentation slide content may be a subsequent presentation slidecontent at a second position in the predefined order.

In accordance with various embodiments, the one or more textual marks inthe transcribed text file is added in proximity to a word in thetranscribed text file, where the user chooses to transition from thefirst presentation slide content to the second presentation slidecontent. In an exemplary embodiment, the one or more textual marks maybe added before a word in the transcribed text file to indicate thetransition from the first presentation slide content to the secondpresentation slide content. In yet another embodiment the one or moretextual marks may be added after a word in the transcribed text file toindicate the transition from the first presentation slide content to thesecond presentation slide content. The system 102 is further configuredto identify an audio sample of the soundtrack corresponding to the word(in proximity to which the one or more textual marks has been added) inthe transcribed text file. In an exemplary embodiment, the system 102may be configured to identify the audio sample based on the transcribedtext file of the soundtrack and the soundtrack. For example, the system102 may be configured to superimpose the transcribed text file over thewaveform of the soundtrack such that each word in the transcribed textfile is superimposed on the corresponding audio section of thesoundtrack. The system 102 is configured to identify an audio samplecorresponding to the position of the one or more textual marks in thesuperimposed file as the audio sample to the transition from the firstpresentation slide content to the second presentation slide content. Inaccordance with various embodiments, the system 102 is configured toidentify the audio samples corresponding to each textual mark in thetranscribed text file. In an exemplary embodiment, the system 102 isconfigured to store the timecode of the identified audio samplecorresponding to the one or more textual marks.

In an exemplary embodiment, at the start of the synchronization process,when the system 102 obtains the transcribed text file having the one ormore textual marks, the system 102 is configured to detect a firsttextual mark in the transcribed text file and identify the first textualmark as a first triggering event to transition from the presentationslide content arranged at the first position in the predefined order(the first presentation slide content, in this case) to the presentationslide content arranged at the second position in the predefined order(the second presentation slide content, in this case). To this end, thesystem 102 is configured to identify a first audio sample, for example,audio sample one (“1”), in the soundtrack corresponding to the positionof the first textual mark in the superimposed file as the audio sampleto the transition from the presentation slide content arranged at thefirst position in the predefined order (the first presentation slidecontent, in this case) to the presentation slide content arranged at thesecond position in the predefined order (the second presentation slidecontent, in this case).

The system 102 is further configured to detect a second textual mark inthe transcribed text file and identify the second textual mark as asecond triggering event. At this stage, the system 102 is configured toidentify the first presentation slide content as the presentation slidecontent arranged at the second position in the predefined order and thesecond presentation slide content as a presentation slide contentarranged at the third position in the predefined media order. The system102 is configured to detect the second triggering event as a trigger totransition from the presentation slide content arranged at the secondposition in the predefined order (the first presentation slide content,in this case) to the presentation slide content arranged at the thirdposition in the predefined order (the second presentation slide content,in this case). The system 102 is configured to identify a second audiosample, for example, audio sample two (“2”), in the soundtrackcorresponding to the position of the second textual mark in thesuperimposed file as the audio sample to the transition from thepresentation slide content arranged at the second position in thepredefined order (the first presentation slide content, in this case) tothe presentation slide content arranged at the third position in thepredefined order (the second presentation slide content, in this case).In accordance with various embodiments, the system 102 is configured toperform the above step until audio samples corresponding to each textualmark in the transcribed text file are identified.

In some embodiments, the system 102 is configured to receive one or moreedits to the transcribed text file associated with the soundtrack, forexample, via the system transceiver 120. The one or more edits maycorrespond to one or more of deleting one or more textual marks, addingone or more textual marks, deleting or adding extra spaces in thetranscribed text file, deleting extra words in the transcribed textfile, or the like. To this end, the system 102 is configured to provideone or more instructions to the user device 104 to allow the user toedit transcribed text file on the user device 104, in particular, viaone or more graphical elements. For example, the system 102 may instructthe user device 104 to display the transcribed text file, via the systemgraphical user interface 118 to enable the user to edit the transcribedtext file. Upon receiving the instructions, the user device 104 isconfigured to receive the one or more edits to the transcribed text filefrom the user for example, via the user device graphical user interface116. Upon receiving the one or more edits, the user device 104 isfurther configured to transmit the transcribed text file reflecting theone or more edits, to the system 102. In an embodiment, the system 102is configured to replace the transcribed file with the updatedtranscribed file in the plurality of presentation slide content saved inthe system memory 126, to reflect the received one or more edits. Thesystem 102 is further configured to instruct the user device 104 todisplay the updated transcribed file to the user.

In some other embodiments, the system 102 is configured to detect thetriggering event in real time identifying an audio sample from theplurality of audio samples of the soundtrack to transition from thefirst presentation slide content to the second presentation slidecontent of the plurality of presentation slide content, while thesoundtrack is being played. In such cases, the triggering event mayinclude one of a mouse click, a key press, a screen touch, or the like,while the soundtrack is being played. To this end, the system 102 isconfigured to determine whether a soundtrack is being played on the userdevice 104. In some embodiments, the system 102 is configured tocommunicate with the user device 104 to determine whether the soundtrackis being played on the user device 104. In an exemplary embodiment, thesystem 102 is configured to instruct the user device 104 to provide userrequest associated with the playback of the soundtrack to the system102. In accordance with various embodiments, the user request mayinclude selection of one or more graphical elements, on the user devicegraphical user interface 116, associated with the playback of thesoundtrack. Upon receiving the user request, the user device 104 isconfigured to provide the user request to the system 102. The system 102is then configured to control the user device 104 to play thesoundtrack, for example, via the speaker within or communicativelycoupled to the user device 104, In accordance with various embodiments,the system 102 is configured to communicate with the user device 104 todetermine that the soundtrack is being played when the user device 104starts playing the soundtrack.

In such cases, the system 102 is configured to consider the user requestto play the soundtrack as a trigger that the user chooses to initiatethe synchronization process. To this end, the system 102 is configuredto instruct the user device 104 to display one or more presentationslide content, from the plurality of presentation slide content, forexample, via the user device graphical user interface 116 on the userdevice 104, when the soundtrack is being played. In an exemplaryembodiment, the one or more presentation slide content may include thefirst presentation slide content and the second presentation slidecontent. As discussed above, the system 102 is considered to identifythe first presentation slide content and the second presentation slidecontent based on the stage of the synchronization process.

In an exemplary embodiment, at the start of the synchronization process,when the first presentation slide content corresponds to the filearranged at the first position in the predefined order and the secondpresentation slide content corresponds to the subsequent file arrangedat the second position in the predefined order, the system 102 isconfigured to detect a first triggering event as a trigger to transitionfrom the presentation slide content arranged at the first position inthe predefined order (the first presentation slide content, in thiscase) to the presentation slide content arranged at the second positionin the predefined order (the second presentation slide content, in thiscase). To this end, the system 102 is configured to identify a firstaudio sample, for example, audio sample one (“1”), in the soundtrackthat is being played at the time of receiving the first triggering eventas an audio sample to the transition from the presentation slide contentarranged at the first position in the predefined order (the firstpresentation slide content, in this case) to the presentation slidecontent arranged at the second position in the predefined order (thesecond presentation slide content, in this case).

The system 102 is further configured to detect a second triggering eventwhile the soundtrack is being played. At this stage, the system 102 isconfigured to identify the first presentation slide content as thepresentation slide content arranged at the second position in thepredefined order and the second presentation slide content as apresentation slide content arranged at the third position in thepredefined media order. The system 102 is then configured to detect thesecond triggering event as a trigger to transition from the presentationslide content arranged at the second position in the predefined order(the first presentation slide content, in this case) to the presentationslide content arranged at the third position in the predefined order(the second presentation slide content, in this case). The system 102 isconfigured to identify a second audio sample, for example, audio sampletwo (“2”), in the soundtrack that was being played at the time ofreceiving the second triggering event as an audio sample to thetransition from the presentation slide content arranged at the secondposition in the predefined order (the first presentation slide content,in this case) to the presentation slide content arranged at the thirdposition in the predefined order (the second presentation slide content,in this case). In accordance with various embodiments, the system 102 isconfigured to perform the above step until audio samples correspondingto each triggering event are identified.

In accordance with various embodiments, the system 102, for example, viathe system processor 124 is configured to obtain the timecode indexedwith the identified audio sample of the soundtrack corresponding to thedetected triggering event. The system 102 is also configured to obtainthe metadata of the second presentation slide content displayed on theuser device 104 when the triggering event is detected. In particular,the system 102 may be configured to obtain the timecode indexed with theidentified audio sample and the metadata of the second presentationslide content from the system memory 126. In accordance with variousembodiments, the system 102, for example, via the system processor 124,is configured to associate the obtained timecode with the metadata ofthe second presentation slide content to link the second presentationslide content with the identified audio sample. In an embodiment, thesystem 102 is configured to associate the obtained timecode with themetadata of the second presentation slide content by reading theobtained timecode and writing the metadata, for example, the slidenumber, directly on to the obtained timecode. In some embodiments, thesystem 102 is configured to store the association in the system memory126. In accordance with some embodiments, the system 102 is configuredto associate the obtained timecode with the metadata of the secondpresentation slide content periodically at predefined intervals, upondetection of the triggering event.

In accordance with various embodiments, the system 102 is configured toreceive input to manually adjust the obtained timecode. To this end, thesystem 102 is configured to determine whether the user chooses tomanually adjust the obtained timecode. For example, the system 102 isconfigured to instruct the user device 104 to display one or moregraphical elements on the user device graphical user interface 116 todetermine whether the user chooses to manually adjust the obtainedtimecode. In an exemplary embodiment, the system 102 is configured todetermine that the user chooses to manually adjust the obtainedtimecode, when the user selects the displayed one or more graphicalelements on the user device graphical user interface 116.

Upon receiving the selection, in various embodiments, the system 102 isconfigured to instruct the user device 104 to receive inputs from theuser for manually adjusting the obtained timecode, via one or moregraphical elements on the user device graphical user interface 116. Forinstance, upon receiving the instructions the user device 104 isconfigured to allow the user to adjust the time code by manuallydragging a playhead over the timeline of the soundtrack, in anaudio/video editor window. The user device 104 is further configured totransmit the adjusted timecodes to the system 102. In an embodiment, thesystem 102 is configured to replace the obtained timecode with themanually adjusted timecode saved in the system memory 126, to reflectthe received adjustments and further to update the associations. Thesystem 102 is further configured to instruct the user device 104 todisplay the adjusted timecodes to the user via the graphical userinterface.

In an exemplary embodiment, the system 102 such as, via the systemprocessor 124 is configured to generate the synchronized presentationmultimedia file having the linked second presentation slide content withthe identified audio sample. In accordance with various embodiments, thegenerated synchronized presentation multimedia file links or associatesthe metadata. of the second presentation slide content with the timecodeof the identified audio sample such that the second presentation slidecontent is displayed when the identified audio sample is played andvice-versa, during the replay of the synchronized presentationmultimedia file. The system 102 is configured to transmit the generatedsynchronized presentation multimedia file to one or more user devices104. Upon receiving the synchronized presentation multimedia. file, theuser device 104 is configured to provide the user with an option, suchas, via the user device graphical user interface 116, to play and/orstore the received synchronized presentation multimedia file. In someembodiments, the user device 104 is also configured to transmit thereceived synchronized presentation multimedia file to other user devices104, such as, via the user device transceiver 106. In some embodiments,the system 102 is configured to store the generated synchronized.presentation multimedia file in a collaborative storage means such as,but not limited to, via cloud storage or a web portal or a social mediaaccount to enable the user to share the generated synchronizedpresentation multimedia. file with other user devices 104.

In accordance with some embodiments, the system 102 is configured toreceive one or more edits to the plurality of presentation slide contentsuch as, via the system transceiver 120. The one or more edits to theplurality of presentation slide content may include, but are not limitedto, editing text of the presentation slide, adding one or morepresentation slides, deleting one or more presentation slides, changingthe predefined order of the plurality of presentation slides, changingthe order of presentation slide content, adding a new presentation slidecontent, deleting a presentation slide content, or the like.

To this end, in accordance with various embodiments, the system 102 isconfigured to determine whether the user chooses to edit the pluralityof presentation slide content. In an exemplary embodiment, the system102 is configured to instruct the user device 104 to display one or moregraphical elements on the user device graphical user interface 116 todetermine whether the user chooses to edit the plurality of presentationslide content. The system 102 is configured to determine that the userchooses to edit, when the user selects the displayed one or moregraphical elements on the user device graphical user interface 116.

Upon receiving the selection, in various embodiments, the system 102 isconfigured to instruct the user device 104 to allow the user to edit thepresentation slide content. For example, the user device 104 isconfigured to allow the user to make one or more edits via thepresentation application. Upon receiving the one or more edits, the userdevice 104 is further configured to transmit the one or morepresentation slide content reflecting the one or more edits, to thesystem 102. In an embodiment, the system 102 is configured to receive asignal from the user device 104 notifying transmission of the one ormore edits to the plurality of presentation slide content from the userdevice 104 to the system 102. Upon receiving the signal, the system 102is configured to obtain the one or more edits from the user device 104.In yet another embodiment, when two or more user devices 104 arecollaborating for creation of the synchronized presentation multimediafile, the system 102 may be configured to receive the one or more editsfrom one or more of the two or more user devices 104.

In an embodiment, upon receiving the one or more edits, the system 102,is configured to replace the presentation slide content with the updatedpresentation slide content. In an exemplary embodiment, the system 102is configured to replace the presentation slide content with the updatedpresentation slide content in the plurality of presentation slidecontent saved in the system memory 126, to reflect the received one ormore edits. The system 102 is further configured to instruct the userdevice 104 to display the updated presentation slide content to theuser.

In an embodiment, the system 102 is configured to transmit one or moreinstructions to the user device 104 to determine whether the userchooses to continue the synchronization process. For example, when theedits are made to the presentation slides that are not yet associatedwith any audio samples (for example, the second presentation slide), theuser may choose to continue or resume the synchronization process. Tothis end, the system 102 is configured to instruct the user device 104to display one or more graphical elements on the user device graphicaluser interface 116 to determine whether the user chooses to continue thesynchronization process. In an exemplary embodiment, the system 102 isconfigured to determine that the user chooses to continue thesynchronization process when the user selects the displayed one or moregraphical elements on the user device graphical user interface 116.

The system 102 is further configured to detect a triggering event toassociate the metadata of the replaced second presentation slide contentwith the timecode of the identified audio sample of the plurality ofaudio samples of the soundtrack. To this end, the system 102 isconfigured to communicate with the user device 104, to detect a furthertriggering event, which is a newly generated triggering event. Forexample, the system 102 is configured to instruct the user device 104,to detect the triggering event received by the user device 104 andcommunicate the detection to the system 102. The system 102 is thenconfigured to identify an audio sample of the soundtrack when thefurther triggering event is detected. The system 102 then, for examplevia the system processor 124, is configured to obtain the timecodeindexed with the identified audio sample of the soundtrack correspondingto the detected triggering event. The system 102 is also configured toobtain the metadata of the replaced second presentation slide contentdisplayed on the user device 104 when the triggering event is detected.In some embodiments, the system 102 may be configured to obtain thetimecode indexed with the identified audio sample and the metadata ofthe replaced second presentation slide content from the system memory126. In accordance with various embodiments, the system 102, forexample, via the system processor 124, is configured to associate thetimecode of the identified audio sample with the metadata of thereplaced second presentation slide content. In some embodiments, thesystem 102 is configured to store the association in the system memory126.

In an embodiment, the system 102 is configured to transmit one or moreinstructions to the user device 104 to determine whether the userchooses to restart the synchronization process. For example, when theedits are made to the presentation slides that are already associatedwith an audio sample, the user may choose to restart the synchronizationprocess. To this end, the system 102 is configured to instruct the userdevice 104 to display one or more graphical elements on the user devicegraphical user interface 116 to determine whether the user chooses torestart the synchronization process. In an exemplary embodiment, thesystem 102 is configured to detect the further triggering event andassociate the metadata of the replaced presentation slide content withthe timecode of the identified audio sample, in accordance with theprocess described above.

In various embodiments, the system 102 is configured to generate asynchronized presentation multimedia file for preview. The system 102 isconfigured to determine whether the user chooses to preview thesynchronized presentation multimedia file. As, the system 102 isconfigured to instruct the user device 104 to display one or moregraphical elements on the user device graphical user interface 116 todetermine whether the user chooses to preview the synchronizedpresentation multimedia file. In an exemplary embodiment, the system 102is configured to determine that the user chooses to preview, when theuser selects the displayed one or more graphical elements on the userdevice graphical user interface 116.

Upon receiving the selection, in various embodiments, the system 102 isconfigured to instruct the user device 104 to play the synchronizedmultimedia presentation file. To this end, the system 102 is configuredto display the first presentation slide and play the soundtrack from thebeginning. The system 102 is further configured to track the timecode ofeach audio sample of the soundtrack, as the soundtrack is being played.The system 102 is configured to detect, for each timecode, if a timecodeof the audio sample is associated with the metadata of any presentationslide content of the plurality of presentation slide content. The system102 is further configured to instruct the user device 104 to display apresentation slide content via user device graphical user interface 116,when the timecode is determined to be associated with the metadata ofthe presentation slide content. The system 102 is configured to repeatthe above-mentioned preview process for each audio sample.

In an exemplary embodiment, when the user chooses to preview from aparticular portion of the synchronized multimedia presentation file, theuser device 104 may allow the user to indicate the desired portion (suchas the audio sample) of the soundtrack for example, but not limited to,by dragging the play head over the timeline of the soundtrack to thedesired portion, or by moving the cursor to the desired portion. Thesystem 102 is further configured to detect the audio sample and theassociated timecode of the audio sample. The associated timecode of thedesired portion is herein referred as a current timecode. The system 102is further configured to determine when the current timecode isassociated with the metadata of any presentation slide content. In oneembodiment, when the system 102 determines that the current timecode isassociated with the metadata of any presentation slide, then the system102 instructs the user device 104 to display the presentation slidecontent with which the current timecode has been associated. Yet inanother embodiment, when the system 102 determines that the currenttimecode is not associated with any presentation slide content, then thesystem 102 instructs the user device 104 to display the presentationslide content with metadata associated with a timecode prior to thecurrent timecode.

FIG. 3 illustrates an exemplary method 200 to synchronize the pluralityof presentation slide content with the soundtrack. Initially, atoperation 202, the method begins with the system 102 obtaining theplurality of presentation slide content arranged in the predefinedorder. In accordance with various embodiments, each presentation slidecontent is associated with the metadata representing a uniqueidentification of the corresponding presentation slide content. Atoperation 204, the system 102 obtains the soundtrack including theplurality of audio samples indexed with corresponding timecodes.Further, at operation 206, the system 102 detects the triggering eventthat identifies an audio sample of the soundtrack as an audio sample totransition from the first presentation slide content to a secondpresentation slide content of the plurality of presentation slidecontent. At operation 208, the system 102 obtains the timecode indexedwith the identified audio sample of the soundtrack corresponding to thedetected triggering event. Further, at operation 210, the system 102associates the obtained timecode with the metadata of the secondpresentation slide content to link the second presentation slide contentwith the identified audio sample. Further, at operation 212, the system102 detects whether there is another triggering event. When the system102 detects the other triggering event at 212, the system 102 returns tooperation 206 and continue with the subsequent operations. At thisstage, as discussed above, the first presentation slide content willcorrespond to a presentation slide at second position and the secondpresentation slide content will correspond to a presentation slide atthe third position. When the system 102 does not detect any triggeringevent at 212, the system 102 proceeds to generate the synchronizedpresentation multimedia file having each of the presentation slidecontents linked with the respective identified audio sample at 214.

Further FIG. 4 illustrates an exemplary method 300 for editing apresentation multimedia file including a plurality of presentation slidecontent, in accordance with some embodiments. The method begins atoperation 302 by the system 102 receiving one or more edits to apresentation slide content of the plurality of presentation slidecontent. At operation 304, the system 102 updates the presentation slideto reflect the received one or more edits. Further, at operation 306,the system 102 replaces the corresponding presentation slide contentwith the updated presentation slide content. In some embodiments, thesystem 102 receives a further triggering event to associate the metadataof the replaced presentation slide content with the timecode of theidentified audio sample of the plurality of audio samples of thesoundtrack. At operation 308, the system 102 associates the metadata ofthe replaced presentation slide content with the timecode of theidentified audio sample. Further, at operation 310, the system 102detects whether more edits are received for another presentation slidecontent. When the system 102 detects that more edits have been receivedat 310, the system 102 returns to operation 304 and continue with thesubsequent operations. The system 102 keeps detecting until no furtheredits are received for the presentation slide contents of the pluralityof presentation slide content.

FIGS. 5 through 9 illustrate exemplary implementations for synchronizingthe plurality of presentation slide content with the soundtrack, inaccordance with some embodiments. To this end, FIGS. 4 through 8illustrate the user device graphical user interface 116 that isdisplayed on the user device display 114 of the user devices 104. Asdiscussed hereinbefore, the system 102 is configured to communicate withthe user via the user device graphical user interface 116.

FIG. 5 illustrates the user device graphical user interface 116, forsynchronizing the plurality of presentation slide content with thesoundtrack when the triggering event may include one of a mouse click, akey press, a screen touch, or the like, while the soundtrack is beingplayed. The system 102 allows the user to choose this method via thegraphical element, such as, sync with button 514. During thesynchronizing process, the system 102 displays the plurality ofpresentation slide content, such as, the first presentation slidecontent 402 in a current slide window 406 and the second presentationslide content 404 in a next slide window 408. The system 102 furtherreceives a selection of the soundtrack, such as the soundtrack 410, viaone or more graphical elements 412 and upon reception, displays thesoundtrack 410 such as, in a window 414 along with the timecode. Thesystem 102 enables the user to track or move to a particular timecode ofthe soundtrack using the play head 422. The system 102 associates atimecode of an audio sample being played with the metadata of the secondpresentation slide content 404, when the soundtrack is played via thegraphical element 416. The system 102 displays the associations of thetimecode and the presentation slide content in a control window 418. Inan embodiment, the system 102 also adds additional music, such as butnot limited to, music A 420-1 and music B 420-2 to the soundtrack 410.

As illustrated in FIG. 6, the system 102 displays a plurality ofgraphical elements, such as but not limited to, a import text file 502,a record 504, a add marks 508, a re-record 512, and so on, to record thesoundtrack and add one or more textual marks in the transcribed file ofthe soundtrack. In accordance with various embodiments, the system 102imports the text file comprising the narration for recording thesoundtrack, upon selection of the graphical element, import text file502. In an exemplary embodiment, the system 102 instructs the userdevice 104 to display the imported text file via graphical element, liketeleprompter 516. Further, the system 102 allows the user to record thesoundtrack upon selection of the graphical element, such as, record 504.Further, the system 102 re-records the soundtrack upon selection of thegraphical element, such as, re-record 512. The system 102 allows theuser to edit the transcribed text file upon selection of the graphicalelement, such as, the edit text 506.

In accordance with various embodiments, the system 102 allows the userto add one or more textual marks in the transcribed text file uponselection of the graphical element, such as add marks 508, asillustrated in FIG. 7. Further, in some embodiments, the system 102superimposes the transcribed text file over the waveform of thesoundtrack, upon selection of the graphical element, such as, export totimeline 510, as illustrated in FIG. 8. In accordance with variousembodiments, after the one or more textual marks are added in thetranscribed text file, the system 102 associates the timecodecorresponding to the textual marks with the metadata of presentationslide, upon selection of the graphical element, such as, cue 602, asillustrated in FIG. 9. The system 102 further displays the associationin the control window 418, as illustrated in FIG. 9.

The system and the method of the present disclosure provide a simple andan efficient solution to synchronize the plurality of presentation slidecontent with the soundtrack to create the synchronized multimedia file.This disclosure provides different embodiments to allow easy andhassle-free generation of the synchronized multimedia file. By simplyadding one or more textual marks over the words, the system allows theuser to select exact location to trigger a change, thereby providinghigh resolution and accuracy to slide and/or animation changes. Also,synchronizing the presentation slide content by way of simple clickprovides an accurate and efficient way of synchronizing, as it allowsthe user to capture the exact desired audio sample to link with thepresentation slide content in real time.

Moreover, the present disclosure provides a quick and faster way ofsynchronizing the presentation slides with the soundtrack. For example,with the embodiments of the present disclosure, a five-minute video witha hundred slides and as many animations can be synchronized in just fewminutes. On the other hand, the synchronization process usingconventional video editing systems takes hours or even days to performthe same task, due to the complexity of the conventional video editingsystems. In addition, the present disclosure provides flexibility to theuser to edit the plurality of presentation slide content even during thesynchronization process. The user can easily make any change in thepresentation slide content and can resume the synchronization processform where the synchronization process was paused, which cannot beachieved in the conventional editing systems.

In the hereinbefore specification, specific embodiments have beendescribed. However, one of ordinary skill in the art appreciates thatvarious modifications and changes can be made without departing from thescope of the invention as set forth in the claims below. Accordingly,the specification and figures are to be regarded in an illustrativerather than a restrictive sense, and all such modifications are intendedto be included within the scope of present teachings. The benefits,advantages, solutions to problems, and any element(s) that may cause anybenefit, advantage, or solution to occur or become more pronounced arenot to be construed as a critical, required, or essential feature orelements of any or all the claims. The invention is defined solely bythe appended claims including any amendments made during the pendency ofthis application and all equivalents of those claims as issued.

Moreover, in this document, relational terms such as first and second,top and bottom, and the like may be used solely to distinguish oneentity or action from another entity or action without necessarilyrequiring or implying any actual such relationship or order between suchentities or actions. The terms “comprises,” “comprising,” “has”,“having,” “includes”, “including,” “contains”, “containing” or any othervariation thereof, are intended to cover a non-exclusive inclusion, suchthat a process, method, article, or apparatus that comprises, has,includes, contains a list of elements does not include only thoseelements but may include other elements not expressly listed or inherentto such process, method, article, or apparatus. An element preceded by“comprises . . . a”, “has . . . a”, “includes . . . a”, “contains . . .a” does not, without more constraints, preclude the existence ofadditional identical elements in the process, method, article, orapparatus that comprises, has, includes, contains the element. The terms“a” and “an” are defined as one or more unless explicitly statedotherwise herein. The terms “substantially”, “essentially”,“approximately”, “about” or any other version thereof, are defined asbeing close to as understood by one of ordinary skill in the art, and inone non-limiting embodiment the term is defined to be within 10%, inanother embodiment within 5%, in another embodiment within 1% and inanother embodiment within 0.5%. The term “coupled” as used herein isdefined as connected, although not necessarily directly and notnecessarily mechanically. A device or structure that is “configured” ina certain way is configured in at least that way but may also beconfigured in ways that are not listed.

It will be appreciated that some embodiments may be comprised of one ormore generic or specialized processors (or “processing devices”) such asmicroprocessors, digital signal processors, customized processors andfield programmable gate arrays (FPGAs) and unique stored programinstructions (including both software and firmware) that control the oneor more processors to implement, in conjunction with certainnon-processor circuits, some, most, or all of the functions of themethod and/or apparatus described herein. Alternatively, some or allfunctions could be implemented by a state machine that has no storedprogram instructions, or in one or more application specific integratedcircuits (ASICs), in which each function or some combinations of certainof the functions are implemented as custom logic. Of course, acombination of the two approaches could be used.

Moreover, an embodiment can be implemented as a computer-readablestorage medium having computer readable code stored thereon forprogramming a computer (example, comprising a processor) to perform amethod as described and claimed herein. Examples of suchcomputer-readable storage mediums include, but are not limited to, ahard disk, a CD-ROM, an optical storage device, a magnetic storagedevice, a ROM (Read Only Memory), a PROM (Programmable Read OnlyMemory), an EPROM (Erasable Programmable Read Only Memory), an EEPROM(Electrically Erasable Programmable Read Only Memory) and a Flashmemory. Further, it is expected that one of ordinary skill,notwithstanding possibly significant effort and many design choicesmotivated by, for example, available time, current technology, andeconomic considerations, when guided by the concepts and principlesdisclosed herein will be readily capable of generating such softwareinstructions and programs and ICs with minimal experimentation.

1. A method for synchronizing a plurality of presentation slide contentin a presentation slide with a soundtrack, the method comprising:obtaining the plurality of presentation slide content arranged in apredefined order in the presentation slide, each presentation slidecontent associated with a metadata representing unique identification ofthe corresponding presentation slide content, wherein the presentationslide content comprises a video in the presentation slide or ananimation in the presentation slide; obtaining the soundtrack includinga plurality of audio samples, each audio sample being indexed with acorresponding timecode detecting a triggering event identifying an audiosample from the plurality of audio samples of the soundtrack totransition from a first presentation slide content to a secondpresentation slide content of the plurality of presentation slidecontent in the presentation slide; obtaining a timecode indexed with theidentified audio sample of e soundtrack corresponding to the detectedtriggering event; associating the obtained timecode with the metadata ofthe second presentation slide content in the presentation slide to linkthe second presentation slide content in the presentation slide with theidentified audio sample; and. generating a synchronized presentationmultimedia file having the linked second presentation slide content inthe presentation slide with the identified audio sample.
 2. The methodof claim 1, wherein the triggering event includes one of a mouse click,a key press, a screen touch, or a textual mark in transcribed text fileof the soundtrack.
 3. The method of claim 1, wherein the triggeringevent is detected in real time while the soundtrack is being played toidentify the audio sample from the plurality of audio samples of thesoundtrack to transition from the first presentation slide content tothe second presentation slide content in the presentation slide.
 4. Themethod of claim 1, further comprising: transcribing the soundtrack intoa text file, wherein detecting the triggering event comprises detectinga textual mark in the text file, the textual mark being added inproximity to a word associated with the audio sample to transition fromthe first presentation slide content to the second presentation slidecontent in the presentation slide.
 5. The method of claim 4, whereinobtaining a timecode indexed with the identified audio sample of thesoundtrack corresponding to the detected triggering event furthercomprises: associating the transcribed text file with the timecodeassociated with the corresponding audio samples; and obtaining thetimecode associated with the detected textual mark in the text filebased on the association.
 6. The method of claim 4, further comprising:receiving input indicating edits to the transcribed text file associatedwith the soundtrack; and updating the corresponding soundtrack based onthe received input.
 7. The method of claim 1, further comprising:receiving input to generate a synchronized presentation multimedia filefor preview; generating the synchronized presentation multimedia filefor preview by: detecting a current timecode associated with thesoundtrack being played; determining whether the current timecode isassociated with metadata of any presentation slide content of theplurality of presentation slide content in the presentation slide; anddisplaying the presentation slide content on a display when the currenttimecode is associated with metadata of a presentation slide content inthe presentation slide.
 8. The method of claim 1, further comprising:receiving input to manually adjust the obtained timecode; and updatingthe obtained timecode to reflect the manually adjusted timecode.
 9. Themethod of claim 1, further comprising: receiving one or more edits tothe second presentation slide content of the plurality of presentationslide content in the presentation slide; updating the secondpresentation slide content in the presentation slide to reflect thereceived one or more edits; replacing the corresponding secondpresentation slide content in the obtained plurality of presentationslide content with the updated second presentation slide content;detecting a further triggering event to associate the metadata of thereplaced second presentation slide content with the timecode of theidentified audio sample of the plurality of audio samples of thesoundtrack; and associating the metadata of the replaced secondpresentation slide content in the presentation slide with the timecodeof the identified audio sample.
 10. A system for synchronizing aplurality of presentation Aide content in a presentation slide with asoundtrack, the system comprising: a memory; and a processorcommunicatively coupled to the memory, wherein the memory storesinstructions executable by the processor, and wherein upon execution ofthe stored instructions the processor is configured to: obtain theplurality of presentation slide content arranged in a predefined. orderin the presentation slide, each presentation slide content associatedwith a metadata representing unique identification of the correspondingpresentation slide content, wherein the presentation slide contentcomprises a video in the presentation slide or an animation in thepresentation slide; obtain the soundtrack including a plurality of audiosamples, each audio sample being indexed with a corresponding timecode;detect a triggering event identifying an audio sample from the pluralityof audio samples of the soundtrack to transition from a firstpresentation slide content to a second presentation slide content of theplurality of presentation slide content in the presentation slide;obtain a timecode indexed with the identified audio sample of thesoundtrack corresponding to the detected triggering event; associate theobtained timecode with the metadata of the second presentation slidecontent in the presentation slide to link the second presentation slidecontent in the presentation slide with the identified audio sample; andgenerate a synchronized presentation multimedia file having the linkedsecond presentation slide content in the presentation slide with theidentified audio sample.
 11. The system of claim 10, wherein thetriggering event includes one of a mouse click, a key press, a screentouch, or a textual mark in transcribed text file of the soundtrack. 12.The system of claim 10, wherein the processor is configured to detectthe triggering event in real time while the soundtrack is being playedto identify the audio sample from the plurality of audio samples of thesoundtrack to transition from the first presentation slide content tothe second presentation slide content in the presentation slide.
 13. Thesystem of claim 10, wherein the processor is further configured to:transcribe the soundtrack into a text file, wherein the processordetects the triggering event by detecting a textual mark in the textfile, the textual mark being added in proximity to a word associatedwith the audio sample to transition from the first presentation slidecontent to the second presentation slide content in the presentationslide.
 14. The system of claim 13, wherein the processor is configuredto obtain a timecode indexed with the identified audio sample of thesoundtrack corresponding to the detected triggering event by:associating the transcribed text file with the timecode associated withthe corresponding audio samples; and obtaining the timecode associatedwith the detected textual mark in the text file based on theassociation.
 15. The system of claim 13, further comprising: a userinterface communicatively coupled to the processor, wherein the userinterface is configured to receive input indicating edits to thetranscribed text file associated with the soundtrack, wherein theprocessor is further configured to update the corresponding soundtrackbased on the received input.
 16. The system of claim 10, furthercomprising: a display; and a user interface, the display and the userinterface communicatively coupled to the processor, wherein the userinterface is configured to receive input to generate a synchronizedpresentation multimedia file for preview, wherein the processor isfurther configured to generate the synchronized presentation multimediafile for preview by: detecting a current timecode associated with thesoundtrack being played; determining whether the current timecode isassociated with metadata of any presentation slide content of theplurality of presentation slide content in the presentation slide; anddisplaying the presentation slide content on the display when thecurrent timecode is associated with metadata of a presentation slidecontent in the presentation slide.
 17. The system of claim 10, furthercomprising: a user interface communicatively coupled to the processor,wherein the user interface is configured to receive input to manuallyadjust the obtained timecode, wherein the processor is furtherconfigured to update the obtained timecode to reflect the manuallyadjusted timecode.
 18. The system of claim 10, further comprising: auser interface communicatively coupled to the processor, wherein theuser interface is configured to receive one or more edits to the secondpresentation slide content of the plurality of presentation slidecontent in the presentation slide, wherein the processor is furtherconfigured to: update the second presentation slide content in thepresentation slide to reflect the received one or more edits; replacethe corresponding second presentation slide content in the obtainedplurality of presentation slide content with the updated secondpresentation slide content; detect a further triggering event toassociate the metadata of the replaced second presentation slide contentwith the timecode of the identified audio sample of the plurality ofaudio samples of the soundtrack; and associate the metadata of thereplaced second presentation slide content in the presentation slidewith the timecode of the identified audio sample.
 19. A method forsynchronizing a plurality of presentation slide content with asoundtrack, the method comprising: obtaining the plurality ofpresentation slide content arranged in a predefined order, eachpresentation slide content associated with a metadata representingunique identification of the corresponding presentation slide content;obtaining the soundtrack including a plurality of audio samples, eachaudio sample being indexed with a corresponding timecode; detecting atriggering event identifying an audio sample from the plurality of audiosamples of the soundtrack to transition from a first presentation slidecontent to a second presentation slide content of the plurality ofpresentation slide content; obtaining a timecode indexed with theidentified audio sample of the soundtrack corresponding to the detectedtriggering event; associating the obtained timecode with the metadata ofthe second presentation slide content to link the second presentationslide content with the identified audio sample; generating asynchronized presentation multimedia file having the linked secondpresentation slide content with the identified audio sample; receivingone or more edits to the second presentation slide content of theplurality of presentation slide content; updating the secondpresentation slide content to reflect the received one or more edits;replacing the corresponding second presentation slide content in theobtained plurality of presentation slide content with the updated secondpresentation slide content; detecting a further triggering event toassociate the metadata. of the replaced second presentation slidecontent with the timecode of the identified audio sample of theplurality of audio samples of the soundtrack; and associating themetadata of the replaced second presentation slide content with thetimecode of the identified audio sample.
 20. The method of claim 19,wherein the presentation slide content comprises one or more of apresentation slide, a video in the presentation slide, or an animationin the presentation slide.